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We use a recently discovered constrained de Finetti reduction (aka "Post-Selection 
Lemma") to study the parallel repetition of multi-player non-local games under no¬ 
signalling strategies. Since the technique allows us to reduce general strategies to inde¬ 
pendent plays, we obtain parallel repetition (corresponding to winning all rounds) in the 
same way as exponential concentration of the probability to win a fraction larger than the 
value of the game. 

Our proof technique leads us naturally to a relaxation of no-signalling (NS) strategies, 
which we dub sub-no-signalling (SNOS). While for two players the two concepts coincide, 
they differ for three or more players. Our results are most complete and satisfying for 
arbifrary number of sub-no-signalling players, where we get universal parallel repetition 
and concentration for any game, while the no-signalling case is obtained as a corollary, but 
only for games wifh "full support". 


I. NON-LOCAL MULTI-PLAYER GAMES AND THEIR PARALLEL REPETITION: SHORT REVIEW 

OE PREVIOUS APPROACHES AND OURS 


A multi-player non-local game is played between cooperating but non-communicating play¬ 
ers. Each player receives an input from some input alphabet and has to produce an output in 
some output alphabet. The common goal of the players is to satisfy some pre-defined predicate 
on their inputs and outputs. For that, they may agree on a strategy before the game starts, but 
are then not allowed to communicate anymore. Such games are especially relevant in theoretical 
physics in the context of the foundations of quantum mechanics and quantum information, and 
in computer science where they arise in multi-prover interactive proof systems. Indeed, they may 
provide an intuitive and quantitative understanding of the role played by various degrees of cor¬ 
relations in global systems which are composed of several local subsystems. These games also 
arise in complexity theory, under the formulation of multi-provers with some shared resources 
producing a protocol that should convince a referee, or in cryptography as attacks from malicious 
parties having a more or less restricted physical power. 

The value of a game is the maximum winning probability of the players, over all allowed joint 
strategies, using possibly some prescribed correlation resource such as shared randomness, quan¬ 
tum entanglement or no-signalling correlations. It has been a subject of considerable study how 
the availability of different resources affects the values of certain games 

In this context, a natural question is how the value of a game behaves when n independent 
instances of the game are played simultaneously, i.e. each player gets n independent inputs and 
has to provide n outputs such that each game instance is won (or a large fraction of them). This 
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is the parallel repetition problem. Playing independently the optimal single-game strategy on all 
n game instances will result in an exponentially decreasing winning probability. But although 


that was found paradoxical at first, this is in general not optimal fig. 11911 . For classical two-player 
games, Raz lIsA . later simplified and improved by Holenstein 12^, established the first general 
parallel repetition theorem, showing that the value of n repetitions decreases exponentially for 
every game. Holenstein 12311 also proved an analogous parallel repetition theorem for the no¬ 
signalling value of general two-player games. Only recently, parallel repetition theorems were 
proved for the entangled value of two-player games: for general games, nothing better than a 
polynomial decay result is known up to now (this was intially proved for a slightly modified game 
bgtl , and very recently only for the game itself IH]), while exponential decay results have been 
established in several special cases (perfect parallel repetition for XOR games lUil . exponential 
decrease under parallel repetition for unique games li^ . projection games Q, free games §0]). 

Even less is known concerning multi-player games. And apart from 0 (containing both 
classical and quantum statements), results were obtained only in the no-signalling setting 


351] . The present work has the same focus on multiple no-signalling players, albeit we will find 


that the theory becomes much more satisfying for sub-no-signalling players. 

Before getting into more precise and technical statements, let us give a high-level exposition of 
the philosophy of the present work, and especially how it compares to or differs from previous 
approaches. The standard proof technique to tackle parallel repetition (in either the classical, the 
quantum or the no-signalling case) consists in iteratively assuming that the players have won a 
given instance of the game and then studying how this affects their winning probability in the 
others. Hence, if one can show that, conditioned on the event "the players have already won 
k instances of the game", the probability is high that they lose in at least 1, resp. most, of the 
n — k remaining instances, one gets exponential decay of the probability of winning all, resp. a 
fraction above the game value of, n instances of the game played in parallel. The main drawback 
of this approach is probably its "locality", which makes it not so straightforward and not so easily 
generalizable to more than two players. Here, we take a more "global" look at the problem, by 
attempting to reduce the study of such n-instance game to that of n i.i.d. 1-instance games, whose 
analysis is trivial. This is where de Finetti type statements come into play: using the fact that the 
repeated game is symmetric under permutation of its parallel rounds, these allow relating it in 
some way to independent rounds. 

However, there are certain steps from the standard route which we do not avoid in our ap¬ 
proach. One of them is some kind of reconstruction step. Phrased informally: we have to be able 
to say at some point that, if our strategy almost satisfies the constraints defining our set of interest, 
then there must exist a strategy which exactly satisfies them and which is not too far away from it. 
Nonetheless, it is not always so easy to get handleable quantitative versions of this quite natural 
expectancy. This is the main reason why the set of sub-no-signalling strategies that we introduce 
is such a nice one for studying parallel repetition. Indeed, we prove that if a strategy satisfies 
all the sub-no-signalling constraints, up to some error e, then it is Ce-close to the set of sub-no- 
signalling strategies, with a constant C which depends only on the number of players. And this 
fact ultimately translates into a universal exponential decay statement for the sub-no-signalling 
value of repeated multi-player games. This kind of stability property actually also holds for the 
set of two-player no-signalling strategies, which was discovered and used by Holenstein to prove 
universal parallel repetition in that case 12311 . Oppositely, it remains unknown whether this is still 
true for three or more players, which explains why all parallel repetition results for strictly more 
than two no-signalling players are game-dependent ones |0,U. Viewing the no-signalling setting 
inside our broader sub-no-signalling framework, we are also able to reproduce these earlier find¬ 
ings. One notable advantage of our approach is that it is particularly well-suited to studying the 
concentration problem as well, and once exponential decay of the probability of winning all game 
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instances is established, exponential decay of winning a too high fraction of them comes almost 
for free. 


II. NON-LOCAL MULTI-PLAYER GAMES AND (SUB-)NO-SIGNALLING STRATEGIES: 
DEEINITIONS AND EIRST OBSERVATIONS 

Specifically, we will consider here ^-player games G with input alphabets ... and out¬ 
put alphabets Ai,..., Ai- By way of notation, 

t i 

^ := Xi and ^ := X A- 

i=l i=\ 

Furthermore, for any subset I C [i] of indices. 


Xj := X •= X A- 

iGl iGl 

An element from Xi,Xj,^ will usually be denoted by Xi^xi^x, respectively, sometimes without 
explicitly specifying the set it belongs to (and similarly for Ai,Ai, A). 

Also, for any I, J C [1], given T a probability distribution (which we may quite often abbreviate 
by "p.d.") on Xj, resp. P a conditional probability distribution on Aj\Xj, we may denote it by 
Txj, resp. Paj\Xi, when confusion on the considered alphabets is at risk. 

From now on, we will be interested in making minimal a priori assumptions on how powerful 
the £ players may be. This will naturally lead us to considering that their common strategy to win 
the game G could be any no-signalling (or even sub-no-signalling) strategy, which we define now. 

Definition 1 The sets of no-signalling and sub-no-signalling correlations, denoted respectively NS{A\^) 
and SN0S(A|£T), consist of non-negative densities P{g]x) > 0 defined as follows: 

P G NS(A|<T) V / C [£], 3 Q{-\xi) p.d.'s on Ai s.t. V x, aj, P{ai\x) = Q{ai\xi), (1) 

P G SNOS{A\^) V / C [£], 3 Q{-\xi) p.d.'s on Aj s.t. V x, aj, P{ai\x) < Q{ai\xi). (2) 
Here, P{ai\x) denotes the marginal density, 

P{ai\x) = P{a = ajai^lx). 

CLjc Gw4jc 

Remark Note that under this definition, NS(A|^) C SNOS(A| A), but the latter is a strictly larger 
set (e.g. it always contains the all-zero density). Furthermore, P G NS(A|fT) iff P G SNOS(A|fT) 
and P is normalized in the sense that for all x G Xa = 1- Indeed, NS consists of condi¬ 

tional probability distributions, while SNOS allows, given each input, a total "probability" of less 
than or equal to 1. 

Also, it can be shown that in equation ([T]), only sets of the form I = [£] \ i need to be considered. 
This is because the no-signalling conditions take the form of equations and this subset spans the 
set of all equations required (cf. Lemma 2.7). The analogous statement for sub-no-signalling 
is not known and likely false. Nevertheless, one might in other contexts consider to relax the 
conditions of equation (|2]) to hold only for a selected family of subsets I c [£]. □ 

An f'-player game G is characterized by a probability distribution T(x) on the queries Xj and 
a binary predicate V{a,x) G {0,1} on the answers and queries A x as illustrated in Figure [TJ 


4 


The no-signalling, resp. sub-no-signalling, value of the game, denoted a;Ns(G), resp. a;sNOs(G'), is 
the maximum of the winning probability 

P(win) =E1/(A,X) = r(x)y(a, x)P(a|x) 

a,x 

over all P G NS(^|£T), resp. P G SNOS(^|^), where the distribution of X = Xi .. .X^ and 
A = Ai... A^is as expected, 

V X, a, P (X = X, A = a) = T{x)P{a\x). 

In words, the (sub-)no-signalling value of a game is the maximal probability of winning it 
when no limitation is assumed on the power of the players, apart from the fact that they cannot 
signal information instantaneously from one another. In the sub-no-signalling case, constraints 
are relaxed even more: players are not forced to always produce an output, and it is only required 
that their strategy "looks as if if were no-signalling" (even though they may have "hidden" in their 
abstentions the fact that it is signalling). In Section IVll we extend on the physical interpretation of 
sub-no-signalling, and briefly discuss other kinds of restrictions that one may put on the players' 
physical power, such as shared randomness or shared quantum entanglement only. 


FIG. 1. An f-player non local game 
Xi G Xi xi G w.p. T{xi... xi) 






Player 1 


Player £ 






(X\ € Ai ai G Ae w.p. P(ai... at\xi.. .xg) 


The players win iff V{ai... ae, xi... xi) = 1 


A. Two-player SNOS = NS 

Not surprisingly, the no-signalling and sub-no-signalling values of games are related. We start 
by showing that for any two-player game G, they are identical, i.e. (Uns(G) = wsnos(G’)- As 
NS C SNOS, the inequality "<" is evident, and we only need to prove the opposite inequality 
">". This follows from the following structural lemma. 

Lemma 2 (cf. lHl ) Let P G SNOS(A x B\X x y) be a two-player sub-no-signalling correlation. Then 
there exists a no-signalling correlation P' G NS{A x B\X x y) with P < P' pointwise, i.e. P{ah\xy) < 
P'{ab\xy) for all a, 6 , x, y. 

Since playing a game G with a strategy P necessarily yields a smaller value than playing it 
with a strategy P' which dominates P pointwise, it is clear that once Lemma |2] is proved we just 
have to apply it to P an optimal SNOS strategy for G to get the inequality ">". 
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Proof If P is normalized, i.e. if for all x, y, J2ab = 1, there is nothing to prove because P 

is already no-signalling. 

Otherwise, there exist x, y with weight ^ sub-no-signalling assump¬ 

tion, we have distributions Q{a\x) and Q(b\y) dominating the marginals: 

V a, b, P{a\xy) < Q{a\x), P{b\xy) < Q{b\y). 

As the total weight of both marginals of P{-\xy) is u) < 1, we can find a and b such that 

Pia\xy) < Q{a\x), P{b\xy) < Q{b\y), 

so we can increase P{ah\xy) by some e > 0 to P'{ab\xy) = P{ah\xy) + e and still satisfy the sub- 
no-signalling conditions. By choosing e maximally so, we can reduce the total number of strict 
inequality signs in the SNOS conditions. Iterating this procedure we arrive at a sub-no-signalling 
correlation P' with all inequalities met with equality, i.e. a no-signalling correlation. 

Another presentation of this argument appeals to compactness. Consider the following set of 
correlations: 

Xp,Q := [P' : a,b,x,y, P'{ab\xy) > P{ab\xy), P\a\x) < Q{a\x), P'{b\y) < Q{b\y)] . 
being compact and P' i-> Ylxy Ylab P'{ab\xy) being continuous, 

sup I ^ ^ P'{ab\xy) : P' G Xp,Q 

I xy ab 

is actually attained. If it were less than | A x y\, we could use the procedure above to increase the 
objective function, contradicting that it is a maximum. □ 

Note that the "bumping up" procedure described above, in order to transform any two-player 
sub-no-signalling strategy into a no-signalling one dominating it pointwise, may fail for more 
players. The two-player case is indeed special, due to non-overlapping of the two SNOS or 
NS constraints. However, already in the case of three players, even just the three inequalities 
PAiAj\XiXjXi, < QAiAj\XiXj uiay be impossible to bring simultaneously to equalities by pointwise 
increment (as illustrated by Example lllBI belowl. 


B. Multi-player SNOS vs NS 


Clearly, WNsjC) < wsnos(G') for every game, and there are examples of games (with game 
distribution T having strictly smaller than full support) where uj-^s{G) < 1 but a;sNos(G') = 1, for 
instance the anticorrelation game. 

Example (cf. ||2], Appendix A) Consider the three-player anti-correlation game A 3 , which has binary 
input and output for all players and game distribution T supported on {0,1}^ \ {HI}, i.e. Ill 
does not occur as a triple of questions. The winning predicate is that if any two inputs are 1, say 
Xi = Xj = 1, then the corresponding outputs must be different, a* / aj. While if there are zero or 
only a single 1 amongst the inputs, outputs may be arbitrary. 

It is straightforward to verify that the following correlation is in SNOS ({0,1}^|{0,1}^) and 
wins the game with certainty: 


P{aia2a3\xiX2X3) 


0 if X 1 X 2 X 3 = 111 , 

<1/8 if 3 1 < i 7 ^ j < 3 : Xj = Xj = 0, 

if 3 1 < i 7 ^ j < 3 : Xi = Xj = I and X 1 X 2 X 3 7 ^ 111. 
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So u;sNOs(^ 3 ) = 1- On the other hand, for, say, T uniform on {Oil, 101,110}, one can check by 
elementary means that wns (d- 3 ) = 2/3. □ 

What happens in the above example is that it is possible to satisfy any two amongst the three 
no-signalling constraints, but not the three of them at the same time. This is a phenomenon 
sometimes referred to as "frustration". 

However, for a game distribution T having full support, a simple reasoning shows that 

<^ns(G) < 1 implies u;snos(G') < 1- Indeed, we show the contrapositive, assuming that tasNOs(G’) = 

1. Because of the full support of T, this implies that for the optimal sub-no-signalling strategy P 
and every x. 


1 = x)P{a\x) < ''^^P{a\x) < 1, 

a a 


hence equality (i.e. normalization) holds for all x. Thus, P is really a no-signalling correlation 
and so a;Ns(G') = 1. In fact, we can show something stronger, namely the following quantitative 
relationship. 


Lemma 3 Consider a game distribution T with full support on Then there exists T = r(r) > 0, 
which only depends on T, such that for every game G with query distribution T, 

‘^snos{G) > 1 — e ^ u}]sis{G) > 1 — (T -I- l)e. 

The definition o/T can be taken from or ii], where it is implicitly defined as some robustness parameter 
of the linear program whose optimal value is wns{G). 

Proof Take an optimal strategy P € SNOS(^|^), so that P{ai\x) < Q(o/|x/) for all I, a/, x. Then, 
^T(x)P(a|x) > ^ T(x)l/(a, x)P(a|x) = a;sNOs(G') > 1 — e. 

a,x a^x 

And so we get, for all I, 

\\^xPai\x-TxQai\Xi\\i = '^T{x){Q{ai\xi) - P{ai\^) < e, 

ai,x 

because the difference term in the sum is non-negative. 

Now simply "bump up" the sub-normalized probability distribution Pa\x to a properly nor¬ 
malized conditional probability distribution adding at most an averaged weight over of 
e, and hence, for all I, 


2\\'^kPai\x-TxQai\Xi\\^ < e - 

At this point we can invoke the stability of linear programs, used in |@] and 10] to conclude that 
there is T = r(r) > 0 such that there is a no-signalling correlation P'f]^p^ £ NS(A| A) with 


\'^K^A\x 
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This gives 


^ns{G) T{x)V{a,x)P"{a\x) 

a,x 

> ''^^T{x)V{a,x)P'{a\x) — Te 

a,x 

> ''^T{x)V{a,x)P{a\x) — Te 

a,x 

> 1 — (r + l)e, 

where we have used the total variational bound on P" — P', the fact that P' dominates P and the 
assumption on the probability of winning G when played P. □ 

The rest of the paper is structured as follows: In Section |III] we introduce parallel repetition 
of games, and state our main results, which improve upon, and partly clarify, earlier findings by 
Holenstein iIt^ . Buhrman et al. |@] and Arnon-Friedman et al. y]. In Section HVl we present the 
main technical tool, one of the constrained de Finetti reductions from fsotl . adapted to our present 
needs, followed by the proofs of the main theorems and corollaries in Section 0 We conclude in 
Section rvn 


III. PARALLEL REPETITION: DEFINITIONS AND MAIN RESULTS 

Given an f-player game G, with probability distribution T{x) on A and binary predicate 
V{a,x) E {0,1} on A X A, we are interested in playing the same game n times independently 
in parallel, and in looking at the probability of wirming all n or a subset of t of them. 

Formally, the n-fold parallel repetition of G is the ^-player game G^ having the product probabil¬ 
ity distribution on 

= r(x(^))--•r(a: 

and the product binary predicate on A” x A** 

E {0,1}. 

The no-signalling, resp. sub-no-signalling, value of this n-fold parallel repetition game, denoted 
a;Ns(G”), resp. ^snosCG"^), is thus the maximum of the winning probability 

P(win)= ^ T®'^{x^)V®^{a^,x^)P{a^\x^) 

,x^ 

over all P E NS(A”| 5 T"), resp. P E SNOS(A”| A^). 

In words, the players win G"^ if they win all n instances of G played in parallel. So we obviously 
always have (for the allowed set of strategies being X E {NS, SNOS}) 

{uJxiG)y <uJxiG^)<uJxiG). (3) 

Flowever, in the case where ujx{G) < 1, the gap between the lower and upper bounds in equa¬ 
tion lO grows exponentially with n, making equation ll3]l very little informative. The parallel 
repetition problem is thus the following: If none of the players' allowed strategies can make them 
win 1 instance of G with probability 1, does it necessarily imply that they have an exponentially 
decaying probability of wirming n of them at the same time? And if so at which rate? 
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More generally, we can study the game whose winning predicate is defined as winning 
any t (or more) out of n repetitions i.e. 

:= > t 

U=i 

Note that, with our notation, = G”/”. 

The main results of the present paper are gathered below, where we set G^ := 2^+^ — 3. 

Theorem 4 (Parallel repetition of sub-no-signalling Gplayer games) Let G he an i-player game 
such that uJsNOsiG) < 1 — Sfor some 0 < (5 < 1. Then, for any n E N, and any t > (1 — 5 + a)nfor some 
0 < a < 6, we have 


10 otherwise. 


/ 52 X" 

wsnos(G") ^ (^f “ ’ 

t^SNOs(G*/”) < exp . 

As immediate consequences or refinements of TheoremHl we can get parallel repetition results 
for the no-signalling value of multiplayer games in some particular instances. 


Corollary 5 (Parallel repetition of no-signalling full support Gplayer games, cf. Q,@]) Let G be 

an i-player game whose distribution T has full support, and such that a;]vs(G) < 1 — 5 for some 0 < 5 < 1. 
Then, for any n € N, and any t > {1 — 6 + a)nfor some 0 < a < d, we have 


U^NsiG^ < 1 - 


u^NsiG^h < 


5 G|(r + 1)2 


a 


exp —n 


^5G2(r + i)2 


where T = r(T) > 0 is the constant from Lemma\^ which only depends on T. 

Note that the constant T in this corollary depends on the game, and in the worst case carries 
a heavy dependence on the players' alphabet sizes. This is in contrast to Holenstein's two-player 
result for no-signalling games, which has no alphabet dependence at all l23ll . This is generalized 
in our Theorem HI since for two players we know by Lemma |2] that NS = SNOS, and we could 
directly read off bounds with constants already improving on Holenstein's. Looking a little into 
the proof allows us to optimize the constants even more, which we record as follows. 


Theorem 6 (Parallel repetition of no-signalling 2-player games, cf. i23l ) Let G be a 2-player game 
with (jj-fsisiG) < 1 — S for some 0 < 5 < 1. Then, for any n E N, and any t > (1 — (5 -|- a)nfor some 
0 < a < 6, we have 


u;jvs(G") < (^1 --j , 
wns(G'‘/’") < exp • 
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IV. CONSTRAINED DE FINETTI REDUCTION 

De Finetti reductions are a useful tool when trying to understand any permutation-invariant 
information processing task. Indeed, these enable to restrict the analysis to that of i.i.d. scenarios, 
which are usually trivially understood. In the context of multi-player games played n times in 
parallel, one would like to use the fact that the numbering of the n instances of the repeated 
game is irrelevant to reduce the study of strafegies for the latter to the study of so-called de Finetti 
strategies (i.e. convex combinations of n i.i.d. sfrategies). 

The seminal de Finetti reduction (aka post-selection) lemma was stated in ilflll . later find- 
irw applications in many areas of quantum information theory, from quantum cryptography 
13111 to quantum Shannon theory [3. Our proofs though, will rely on two more recently es¬ 
tablished de Finetti reduction results, which are stated below. Just to fix some definitions: we 
will say that a (sub-)probability distribution Pz^, resp. a conditional (sub-)probability distribu¬ 
tion Pjgn\yn, is n-symmetric if for any permutation tt of n elements, V z^, P{7r{z'^)) = P(z’^), 
resp. V P(7r(6”')|7r(?/”)) = P(6"’|y"’). 

Lemma 7 (de Finetti reduction for conditional p.d.'s, iQ]) Let B, y be finite alphabets. There exists 
a probability measure on the set of conditional probability distributions Rs\y such that, for any 

n-symmetric conditional probability distribution P^n\yn, 



where the polynomial pre-factor may he upper bounded as poly(n) < {n-\- 

Lemma 8 (Constrained de Finetti reduction for (sub-)p.d.'s, ijs^) Let Z be a finite alphabet. There 
exists a probability measure dQz on the set of probability distributions Qz on Z such that, for any n- 
symmetric (sub-)probability distribution Pz^ on Z^, 

Pzr^ < poly(n) [ F {Pz-, Qf^ dQ z, 

JQz 

where the polynomial pre-factor may be upper bounded as poly(n) < {n-\- 

In Lemma |8] above, as well as in the remainder of this paper, F{P,Q) stands for the fidelity 
between probability distributions P and Q, defined as F{P, Q) = \\^fP^/Q\\l. 

We are now ready to present the technical lemma that will allow us in Section |V] to reduce 
the study of strategies for repeated games to the study of so-called de Finetti strategies, and hence 
prove our main results. 

Lemma 9 (de Finetti reduction for sub-no-signalling correlations) There exists a probability mea¬ 
sure dQ on the set of probability distributions Q on Ax ^ such that for any probability distribution T on 
X_and any P G SNOS^A^lAT) an n-symmetric sub-no-signalling correlation, it holds that 

< poly(n) [ F{Q^f^Q%dQ^, (4) 

JQ ax 

where we defined 

F {Qm) := min max F {TxRaj\Xj,QAjx) ■ 

Raj\xj 

We mention for the sake of completeness that the poly(n) pre-factor in equation dU) may be upper bounded 

^,y(j^+l)3|A|2|^|2+2|^||^|^ 
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Proof Since is an n-symmetric sub-probability distribution on (AX) '^, we first of all 

have by Lemma that 

TTPa-\x- < poly(n) [ F (tTPa-\x-,Q%Y QTx ^QaK- 

- JQax ^ ^ — 


Notice next that, for any 0 7 ^ / ^ [f”]. 


F 



Pa^\x^ 


Q 


®n 


AX - 


<F{T' 


<f(^ 


rp<S>n r)f 



The first inequality is by monotonicity of the fidelity under stochastic maps (in particular taking 
marginals). While the second inequality is because P G SNOS(^"^|£T"^), so that < P'a^ix” 

for some conditional p.d. P'jcn^p^n, and because the fidelity is order-preserving. 

What is more, for any 0 7 ^/ ^ [^], P'jjji^^^ can be chosen to be an n-symmetric conditional 
probability distribution. Indeed, if it were not, its n-symmetrization would still upper bound 
Pa^A"^ (since the latter is by assumption n-symmetric). We then have by Lemma [71 that 


— Poly(^) / P\\xAPai\Xi-, 

R A i\Xj 

and subsequently, using first, once more, that the fidelity is order-preserving, and second that it 
is multiplicative on tensor products. 


77 (rp®n p/ 

^ ^A^\xy 



< poly(n) max F 

RAj\Xj 


(rp^n p®n 
[^X ^Ai\Xi 



= poly(n) max F {TxRai\Xi,Qaix)'^ ■ 

RXj\Xj 


Recapitulating, we get 


rp®n P 

-^x ^ 


V|V" < poly(n) [ ( min max F {TxRai\Xi,Qaix) ] Qax ^Qax, 

JQax y^i&]RAj\Xj J — 


as announced. 


□ 


V. PROOFS OF THE MAIN THEOREMS 

In this section we prove Theorem jH Corollary |5| and Theorem 0 
We need first of all the following extension of Lemma 9.5 in 1231] : 

Lemma 10 For Z = Xj^i Zj and B = XjZi Bj, consider probability distributions T on ^ and P on 
Bx^ satisfying 

\\\PA-Tdi^^o- (5) 

If for each j G [m] there exists a conditional probability distribution Q{bj\zj) such that 

2 11 - TzQsj |.zj 1 < Cp 


( 6 ) 
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P'ikll) such that, for each j G [m], P'{bj\z) = 

m 

< eo + ^ 2ej. (7) 

i=i 

Proof This works exactly as the proofs of the case m = 2, appearing as Lemma 9.5 in it^ . or of 
the case m = 3, appearing as Lemma 5.4 in iB^ . Both statements follow from applying either two 
or three times Lemma 9.4 in j^ . Let us state the latter for completeness, and then only sketch 
how the proofs of the cases m = 2 or m = 3 generalize to any m. 

Holenstein (|0|, Lemma 9.4): Let Psj- and be probability distributions over SxT 
and S respectively. There exists a probability distribution Rsj- over SxT such that 

||f?5r - PstWi - 11^5 “ -P<s||i and \\Rs - QsHi = \\Pt - Pt\\i = 0- 

Thanks to this result, we know that we can recursively construct a sequence ... ^pT) of 
probability distributions on 5 x Z such that, setting = Q, for each j G [m], we have: for any 
fixed z G ^ 

11 ~ ^B\Z=z 111 - II \Z=z - QbjI Zj =Zj\\i, 

l|■^i^|^=^ “ ^Bq2j=Zj||i = 0) 

V A: G [m\ \ {j}, — QSfcl:Z.=:z|li ~ 

The probability distribution pT) then satisfies 

m 

llTkPifi - Pszl < WPz-Td, +T'^\\Pb,z - T2Qe,l2,\\v 

1 = 1 

and can therefore be chosen as the desired P'. □ 

We just mention as a side note that Lemma 9.4 in crucially relies on the following fact: the 
statistical distance between two probability distributions Pi, P 2 , i.e. 

>-p*. 

can be equivalently characterized as the minimum probability that Xi differs from X 2 over pairs 
of random variables (Xi, X 2 ) sampled from P having (Pi, P 2 ) as marginals. 

Note that the conditions enforced in LemmalTOlare not enough to ensure no-signalling of P' for 
three or more players. They would be sufficient though to guarantee that P' satisfies the relaxed 
no-signalling constraints considered in jH], namely that any group of£—l players together cannot 
signal to the remaining player. In other words, if a correlation approximately satisfies the Markov 
chain conditions necessary for being no-signalling, in the form of equations (O and (O, then it is 
approximated, in the sense of equation (O, by a "weak" no-signalling correlation, as considered in 
l35l 1. Nevertheless, we can leverage this result to approximate the given no-signalling correlation 
by a sub-no-signalling correlation. 

Lemma 11 Let P be a probability distribution on Ax ^ and T be a probability distribution on X.- If Ihe 
no-signalling conditions O hold approximately, namely 

VIC [£], 3Q{-\xi)p.d.'sonAiS.t. ^\\PaiX-TxQai\Xi\\i < e/, 


then there exists a conditional probability distribution 
P' (bj I Zj) for all bj , ^ and 

2\\'^zPb\z- Pm.\\i 
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then there exists a sub-no-signalling correlation P' G SNOS{A\^) that approximates P, in the sense that 

2\\'^kPa\x-Pak\\i^^iIi+ 2 e /. 

0^/C[£] 

In the two-player case i = 2, P' can be chosen to be no-signalling itself, P' G NS(^|£^). 

Proof We will apply Lemma ITOl with m = 2^ — 2, the index j identifying a non-empty and non¬ 
full set 0 7 ^ / $1 [i’] (for instance via the expansion of j into i binary digits). The local input and 
output alphabets are 


z, = X 

i£l 


B, = X A, 

iei 


and the distribution we apply it to is 


P{bz) 


Pjax) if Vj, bj = {tti i G I), Zj = {xi '■ i G I), 

0 otherwise. 


Likewise, the prior distribution on Z_ is given by 

T{z) = 


T{x) Zj = {xi : i G I), 

0 otherwise, 


and we use the conditional distributions Q{bj\zj) = Q{ai\xi). 

Now, the prerequisites of Lemma [10] are given, with Cj = ej, and thus we get a conditional 
probability distribution P' with P'{bj\z) = P'{bj\zj) for all j, and 

1 ^ 

2\\^zP'b\z - P^Wi < eo + 

1=1 

We would like to conclude here by "pulling back" this conditional distribution to a correlation on 
A X Xj which we would wish to be no-signalling. This almost works, except that P' has support 
outside the image of the diagonal embedding 


a I—)• b s.t. Vj, bj 


[ai'.iG I), 


and likewise for A : X_ —> Z_- 

To resolve this issue, we simply remove this part of the distribution, and define the desired 
sub-normalized conditional densities by letting 

P'{a\x) := P'(A(a)|A(x)). 


From this we see directly that 

\\\^xP'a\x- PakWi - 

because P{b, z) = P(a, x) for b = A{gf and z = A(x), and it is 0 outside the image of A. 
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It remains to check that P' is sub-no-signalling. Let 0 / / C [£] be a subset with corresponding 
index l<j < 2 ^- 2 . Let also x £ ^ aj £ Aj he tuples, and set z = A(x), 6 = A (a) (so that 
Zj = xj £ Aj = Zj, bj = aj £ Ai = Bj). Then, 

P'(a/|x) = ^ 

djc Gv4jC 

= ^ P'(A(a)|A(2:)) 

djc Gv4/c 

< E 

-P'{bj\Zj) Q'{ai\xi). 

Here, we have used the definition of the marginal and of P'. The inequality in the third line 
is because we enlarge the domain of fhe summation, and the equality in the last line is by the 
marginal property of P'. 

The last claim, regarding £ = 2 players, is the original Lemma 9.5 in lEsll . □ 

We are now ready to prove our main theorem, namely the parallel repetition and concentration 
results for the sub-no-signalling value of multi-player games. 

Proof [Proof of Theorem jH Let Pj^-^\x”- be a sub-no-signalling correlation which is optimal to 
win the game G^. The distribution and the predicate of G^ being n-symmetric, we 
can assume without loss of generality that Pa'^\x^ is also n-symmetric. Indeed, since for any 
permutation vr of n elements, T o vr = T and V ott = V, playing G” with P or with P o tt yields the 
same winning probability. And therefore, if P is an optimal sfrategy then so is its symmetrization 
over all permutations of n elements. Hence, by Lemma |9l 

TTPa’^IK- < poiy(^) [ P {Qak?" 

•^Q ax 

Now, fix 0 < e < 1 and define 

Pe ■= \ Qax ■ max min 1:\\TxRai\Xj - QAix\\i < e 

Observe that, by well-known relations between fidelity and trace-distance (see e.g. IIS])/ if Qax ^ 
Pe, then automatically F (QaxY < I — e^. Hence, 

TTPa-\x- < poly(n) ( [ dQAx + (1 - dQ^ 

On the other hand, if E Ve, then by definition 

V 0 / I ^ [i\, 3 Rai\Xi ■ -^^IP^PaiIXi - QaixWi < e. 

By Lemma Ull the latter condition implies that there exists a sub-no-signalling correlation 
such that 

^WTxR'aix - QaxWi < Qe, where Q = I + 2(2^ - 2) = 2^+^ - 3. 
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Yet, the winning probability when playing G with a strategy G SNOS(^|^) is, by assump¬ 
tion on G, at most 1 — (5. So the average of the predicate of G over G is at most 1 — (5 -|- 2Qe. 

Putting everything together, we eventually get that the winning probability when playing G” 
with strategy is upper bounded as 


P(win) < poly(n) ((1 - 5 + 2GteY + (1 - e^)”) . 


Choosing in equation (|8]) 

e = G, 


' + 4 ) 


1/2 


- 1 > 


99(5 

200Gf 


, so that > 


5G|’ 


( 8 ) 


and recalling that is, by hypothesis, an optimal sub-no-signalling strategy, we obtain 

‘^SNOs(G'”) < poly(n) ^1 - . (9) 

In order to conclude, we have to remove the polynomial pre-factor. So assume that there exists 
a constant G > 0 such that for some iV G N, Ci;snos(G'^) > C (l — (5^/5G|)^. Then, for any n G N, 
we would have 

/ ^2 \ 

wsnos(G'^"') > (wsnos(G^)) > G” ^1 - 

On the other hand, however, we still have by equation (Q 

/ (52 

wsnos(G'^'') < poly(iVn) ^1 - ^ j 

Letting n grow, we see that the only option to make these two conditions compatible is to have 
G < 1, which is precisely what we wanted to show. 

Following the exact same lines as above, we also get the concentration bound. Indeed, for any 
t > {I — 6 + a)n, we now have in place of equation (|8]) that, for any 0 < e < 1, 

wsnos(G'‘^"') < poly(n) (exp [-2n(a - 2Gief] + exp [-ne^]) . (10) 

The first term in the r.h.s. of equation flOll is a consequence of Hoeffding's inequality: if A, 2L are 
distributed according to G Ve, then the value of the game predicate is on average at most 
1 — (5 -|- 2Cie, so for n independent such A, X, the probability that the sum of the n values of the 
game predicate is above {l — 5+a)n is at most exp[—2n(Q; —2G£e)^]. The second term in the r.h.s. of 
equation (ITOl) is obtained by simply using that e~^ >1 — x for any x > 0. 

The announced upper bound follows from choosing in equation (flO)) 


e 


{ACi - V2)a ^ 5(20 - V2)a 
SGf - 1 - 199Q 


so that I 


> 


a 


and removing the polynomial pre-factor by the same trick as before. □ 

Proof [Proof of Corollary|5l By LemmajSl we know that if G is an Gplayer game with full support 
satisfying uj^^{G) <1 — 6 , then ccsnos(G’) < 1 ~ '^/(r + !)• And thus by Theorem|4[ 


1 - 


,52 


‘^ns(G'"') < <^snos(G'’^) < 


5G|(r + l)2 
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The concentration bound for C(Jns(G'^/") follows analogously. □ 

Proof [Proof of Theorem[6l We follow the exact same reasoning as in the proof of TheoremjH and 
keep the same notation. In the case i = 2 , we have by Lemma [IT] that. for any 0 < e < 1, 

G T’. ^ 3 G NSUI^) : - Qax\\i < 5e. 

Yet, if the winning probability when playing G with a strategy G NS(^|£T) is, by assumption 
on G, at most 1 — <5, then the average of the predicate of G over Qax G Re is at most 1 — 6 + 5e. 
This is because we are here dealing with normalised probability distributions. Hence, for any 
0 < e < 1, 


oJNsiCn < poly(n) {{ 1-6 + 5er + (1 - e^T) , 

^SNOsiG^^"') < poly(n) (exp [-2n(Q; - 5e)^] + exp [-ne^]) . 

We can now choose e = — 5)6/2 in the parallel repetition estimate and e = (10 — -\/2)a/49 in 

the concentration bound one, and argue as in the proof of Theorem |4] to remove the polynomial 
pre-factor, which yields the two advertised results. □ 


VI. DISCUSSION 


Our main contribution in the present paper is a concentration result for the sub-no-signalling 
value of multi-player games under parallel repetition. In fact, we believe that our work is the 
first to recognize the intrinsic interest of the class of sub-no-signalling correlations, which appears 
naturally as a relaxation of the no-signalling ones. In particular, the fact that sub-no-signalling 
correlations have total probability less than or equal to 1 can be interpreted as the possibility of 
"abstaining" from giving an answer in ^ with a certain probability depending on the input in 
However, each marginal Pai\x has to be consistent "locally" with the no-signalling behaviour, 
in that it has to be dominated by a correlation Qai\Xi that depends only on the I positions of the 
input X. This means that each group I of players is able to interpret their observed statistics as 
being "really" governed by a local marginal QaiIx^/ only that sometimes the device generating 
the correlation defaults and does not give an answer. The probability of abstention depends on 
the entire input x, and thus would be signalling, if observed. Indeed, the anti-correlation game 
discussed in Example lll BI shows that a sub-no-signalling correlation cannot always be embedded 
in a no-signalling one, except in the case of two players (cf. Lemma |2]). Abstention thus gives 
more power in general, but it comes with a price as well, since abstaining does not mean winning 
the game. In this sense, it should not be confused with plain post-selection (that is, conditioning) 
on the non-abstaining event, which is well-known to allow the violation of Bell inequalities by 
otherwise local hidden variables, via the so-called "detection loophole "MM- 

Specifically, if an ^-player game G has SNOS value 1 — 6 , then the probability for SNOS players 
to win a fraction at least 1 — 6 + aoin instances of G played in parallel is at most exp(—nC^a^), 
where Q > 0 is a constant which only depends on the number I of players. This, a universal 
multi-player parallel repetition and concentration bound, is in contrast to the results on |@] and |j2], 
which are restricted to full-support game distributions and with constants that seem to depend 
heavily on the game. We think of these findings as evidence that sub-no-signallmg correlations 
are natural, due to their well-behaved parallel repetition properties. As hinted at in |@], such a 
result, valid for games involving strictly more than 2 players and where not all queries are asked 
10], might potentially find applications in position-based cryptography |0,E3]- It would also be 
interesting to investigate whether the recent work of showing multi-prover interactive proofs 
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for EXP (exponential time languages) that are robust against no-signalling provers, remains valid 
for sub-no-signalling provers, and whether our result can be generalized to amplify the soundness 
gap of their scheme. The latter is not self-evident, as they require a polynomial number of provers, 
but our bounds carry a penalty exponential in the number of players. 

In the case 1 = 2, our concentration statement is actually equivalent to the analogous one for 
the no-signalling value of G, fhus wifh a universal constant c = 6*2 in the exponential bound. And 
we know we carmot hope for a better dependence in a than the obtained one, even in the special 
case a = 6, as proved in 12711 . In the case i > 2, our result implies a concentration bound for fhe 
no-signalling value of G, but only if its input distribution has full support. Besides, the constant in 
the exponential bound is this time highly game-dependent (dependence on the sizes of fhe input 
and output alphabets, and on the smallest weight occurring in the input distribution). This is 
fully comparable fo previous work in this direction due to Buhrman, Fehr and Schaffner |@]/ 
Arnon-Friedman, Renner and Vidick |j2]. 

Hence, the most immediate open problem at that point is regarding games with non-full sup¬ 
port in the case of three or more players (e.g. the anti-correlation game): does a parallel repetition 
result hold for the no-signalling value of such mulfi-player games? Answering fhis question prob¬ 
ably requires to understand first whether in Corollary |5l the presence of the game parameter F 
is really necessary or is just an artifact of fhe proof fechnique. In other words, does the rate at 
which the no-signalling value of a game decays under parallel repetition truly depends on the 
game distribution? 

Another issue that would be worth investigating is whether constrained de Finetti reductions 
could also be used to establish parallel repetition results for fhe classical or quantum value of 
mulfi-player games. Formally, the sets of classical correlations C{A\^ and quantum correlations 
Q(A| A) are defined as follows: 


P G C(A|A) :<t4> V X,a, P(a|x) = Q{m)Pi{ai\xim) ■ ■ ■ Pi{ai\x(,m), 

m&M 


for some p.d. Q on some alphabet M and some p.d.'s Pi{-\xi m) on Ai. 

P e Q(=4lf^) Vx,a, P{a\x) = (V'|M(xi)ai ( 8 ) • • • ® M{xe)ai\'ip), 


for some pure state |'(/')(V'l on Hi (g) • • • < 8 ) and some POVMs M{xi) on H*. 

And the classical, resp. quantum, value of an ^-player game G with distribution T and predicate 
V, denoted a;c(G), resp. ojq{G), is then naturally defined as the maximum, resp. supremum, of 
the winning probability 


P(win) = ^r(x)A(a,x)P(o|x) 

a,x 


over all P G C(A| A), resp. P G Q(A| A). 

In the classical case, the first parallel repetition result for two-player games was established 
by Raz |34], and later improved by Holenstein while Rao gave a concentration bound. 
However, the proof techniques are arguably not as straightforward as via de Finetti reductions, 
and do not generalise directly to any number £ of players. In fhe quantum case, even less is known. 
The best parallel repetition result up to now is the one established by Chailloux and Scarpa |@] 
(subsequently improved by Chung, Wu and Yuen iflUl l. which applies to two-player (^-player) 
free games, and from there to games with full support. That is why being able to export ideas 
from the de Finetti approach to these two cases would be of great interest. Roughly speaking, the 
problem we are facing is the following: Given an n-symmetric correlation we can always 
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write the first step in the proof of Lemma|9l i.e. 



Now, we would like to argue that if is a classical, resp. quantum, correlation, then the 

p.d.'s Qax for which the fidelity weight in the r.h.s. of equation (ITT]) is not exponentially small 
are necessarily close to being of the form for some classical, resp. quantum, correlation 

RjX\x- This was precisely our proof philosophy in the no-signalling case. However, the fact that 
the classical and quantum conditions are not properties that one can read off on the marginals, 
contrary to the no-signalling one, seems to be a first obstacle to surmount. 

One related legitimate question would be the following: is it possible to make an even stronger 
statement than the one that, as explained above, we either are looking for (in the classical and 
quantum cases) or already have (in the no-signalling case)? Namely, could we upper bound 
by a de Finetti distribution analogous to that in the r.h.s. of equation dTT]) , but with 
weight strictly 0 on p.d.'s which are not of the form T^Rj^\xi for Ra\x belonging to the same 
class as Pjij^\x^'^ The answer to this question is no. Indeed, such improved de Finetti reduction 
would imply a strong parallel repetition result, which we know does not hold (see |0] for a similar 
discussion). So the best we can hope for is really to show that the fidelity weight in our upper 
bounding de Finetti distribution is exponentially small on the p.d.'s which are too far from being 
of the desired form. 

Finally, let us briefly comment on the main spirit difference between the present work and 
the one by Arnon-Friedman et al. |[3]. Our approach consists in using a more "flexible" de Finetti 
reduction, in which the information on the correlation Pjx^\x^ and the p.d. r|” of interest are kept 
in the upper bounding de Finetti distribution, through the fidelity weight . 

Whereas in iQ], any initial correlation is first upper bounded by the same universal de Finetti 
correlation, on which a test (specifically tailored to the considered game distribution) is performed 
in a second step, that has the property of letting pass, resp. rejecting, with high probability the 
strategies which are no-signalling, resp. too signalling. So it seems in the end that both approaches 
are quite closely related: in our case, the "signalling test" which is applied to a given p.d. Qax 
is nothing else than the maximal fidelity of Qax to the set of p.d.'s of the form T^Rji^\x/ with 
^A\x no-signalling, being above or below a certain threshold value. Also, it would be interesting 
(and potentially fruitful) to investigate whether one could combine in some way the techniques 
yielding Lemmas [7] and HI to get a de Finetti reduction result that would have the advantages 
of both: namely, that is designed for conditional p.d.'s while at the same carrying the relevant 
information on the conditional p.d. it is applied to. 
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